Attorney's Docket No.: 34874-1 65-UTL / 2003P00147US 

Customer No. : 64280 

Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1 . (Currently Amended) A method for indexing documents in a collection of 
documents, each document comprising one or more index terms, the method 
comprising: 

determining a first value x representative of a first location of a first skip entry in 
an inverted index such that x is determined as an integer corresponding to a quantity of 
documents including at least a majority of the index terms occur i n x docum e nts or 
f e w e r, wh e r o x i s an i nt e g e r ; 

determining a second value y, where y does not exceed i s not e qua l to x, the 
second value v determined as an integer and representative of a second location of a 
second skip entry in the inverted index wh e r e v is an i nt e g e r : 

generating [[an]] the inverted index for the collection of documents, the inverted 
index including an inverted list for each of the index terms, each inverted list including at 
least one posting and, if the number of postings exceeds x, further including [[a]] the first 
skip entry after the x th posting and one or more second skip entries thereafter at 
intervals of every y th posting; 

wherein: 
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[[a]] the at least one posting includes a document identifier identifying a 
document in the collection of documents; 

[[a]] the first and second skip entfy entries including i nclud e s a document 
identifier that is included in a boundary posting of a block of postings immediately 
adjacent to the skip entry in the inverted list, where a block of postings includes postings 
having document identifiers ranging from a lower to an upper value and where a 
boundary posting is a posting having a document identifier of either the lower or the 
upper value. 

2. (Previously Presented) The method of claim 1, wherein each posting 
further includes a position identifier identifying a position of the index term in the 
document, and a frequency of the index term occurring in the document. 

3. (Currently Amended) The method of claim 1 , wherein the boundary 
posting includes a document identifier having the lower value in the range of values and 
the block of postings follow the first skip entry in the inverted list. 

4. (Currently Amended) The method of claim 3, wherein the first skip entry 
further includes information to locate the next skip entry in the inverted list. 

5. (Currently Amended) The method of claim 1 , wherein the boundary 
posting includes a document identifier having the higher value in the range of values 
and the block of postings precede the first skip entry in the inverted list. 

6. (Currently Amended) The method claim 5, wherein the first skip entry 
further includes information to locate the next skip entry in the inverted list. 



7. (Canceled) 
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8. (Currently Amended) The method of claim 1 , wherein x is selected from m 
a first range of 256 to 512 and y is selected from m a second range of 128 to 256 A 
wherein v is not selected to have the same value as x . 

9. (Original) The method of claim 1 , wherein the collection of one or more 
documents includes one or more binary files, data tables, source code files, text 
documents or combinations thereof. 

10. (Original) The method of claim 1, further comprising: 
compressing the inverted index. 

1 1 . (Original) The method of claim 1 , wherein substantially all of the index 
terms occur in x documents or fewer. 

12. (Previously Presented) The method of claim 1 1 , wherein at least 
approximately 80% of the index terms occur in x documents. 

13. (Original) The method of claim 1 , wherein for each inverted list, if the 
number of postings exceeds x, further including a skip entry before the first posting in 
the inverted list. 

14. (Original) The method of claim 1 , wherein for each inverted list, if the 
number of postings exceeds x, further including a skip entry after the last posting in the 
inverted list. 
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15. (Currently Amended) A method for indexing documents, the method 
comprising: 

receiving a collection of documents, each document comprising one or more 
index terms; 

determining a first value x representative of a first location of a first skip entry in 
an inverted index , wherein at least a majority of the index terms occur [[in]] within x 
documents or fowor and x is an integer in a first range of 256 to 512; 

determining a second value y representative of a second location of a second 
skip entry in the inverted index , wherein y does not exceed the value x and is an integer 
in a second range of 128 to 256; and 

generating [[an]] the inverted index for the collection of documents, the inverted 
index including an inverted list for each of the index terms, each inverted list including at 
least one posting and, if the number of postings exceeds x, further including [[a]] the first 
skip entry after the [[x th ']] x^ posting and one or more second skip entries thereafter at 
intervals of every y th posting; 

wherein: 

[[a]] the at least one posting includes a document identifier identifying a 
document in the collection of documents; 

[[a]] the first and second skip e ntry i nc l ud e s entries include a document identifier 
that is included in a boundary posting of a block of postings immediately adjacent to the 
skip entry in the inverted list, where a block of postings includes postings having 



TRA 2251918v.l 



-5- 



Attorney's Docket No.: 34874-1 65-UTL / 2003P00147US 

Customer No. : 64280 

document identifiers ranging from a lower to an upper value and where a boundary 
posting is a posting having a document identifier of either the lower or the upper value. 

16. (Previously Presented) The method of claim 15, wherein each 
posting further includes a position identifier identifying a position of the index term in the 
document, and a frequency of the index term occurring in the document. 

17. (Currently Amended) The method of claim 15, wherein the 
boundary posting includes a document identifier having the lower value in the range of 
values and the block of postings follows the first skip entry in the inverted list. 

1 8. (Currently Amended) The method of claim 1 7, wherein the first skip 
entry further includes information to locate the next skip entry in the inverted list. 

19. (Currently Amended) The method of claim 15, wherein the 
boundary posting includes a document identifier having the higher value in the range of 
values and the block of postings precede the first skip entry in the inverted list. 

20. (Currently Amended) The method claim 15, wherein the first skip 
entry further includes information to locate the next skip entry in the inverted list. 

21 . (Original) The method of claim 1 5, wherein substantially all of the 
index terms occur in x documents or fewer. 

22. (Original) The method of claim 21 , wherein approximately 80 to 
90% of the index terms occur in x documents or fewer. 

23. (Original) The method of claim 15, wherein for each inverted list, if 
the number of postings exceeds x, further including a skip entry before the first posting 
in the inverted list. 

24. (Original) The method of claim 15, wherein for each inverted list, if 
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the number of postings exceeds x, further including a skip entry after the last posting in 
the inverted list. 

25. (Currently Amended) An inverted index for a collection of 
documents, each document comprising one or more index terms, the inverted index 
comprising: 

an inverted list for each index term in the collection of documents; and 

one or more inverted lists including a quantity of postings that exceeds a value x, 
a skip entry after the x th posting, and one or more additional skip entries thereafter at 
intervals of every y th posting, where the value x is determined as an integer 
corresponding to a quantity of documents including such that at least a majority of the 
index terms occur in w i th i n x docum e nts or f o w o r , and the value y is determined as an 
integer and does not exceed the value x and x and y ar e i nt e g e rs ; 

wherein: 

a posting includes a document identifier identifying a document in the collection 
of documents; 

a skip entry includes a document identifier that is included in a boundary posting 
of a block of postings immediately adjacent to the skip entry in the inverted list, where a 
block of postings includes postings having document identifiers ranging from a lower to 
an upper value and where a boundary posting is a posting having a document identifier 
of either the lower or the upper value. 
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26. (Previously Presented) The inverted index of claim 25, wherein 
each posting further a position identifier identifying a position of the index term in the 
document, and a frequency of the index term occurring in the document. 

27. (Currently Amended) The inverted index of claim 25, wherein the 
boundary posting includes a document identifier having the lower value in the range of 
values and the block of postings follow the first skip entry in the inverted list. 

28. (Currently Amended) The inverted index of claim 27, wherein the 
first skip entry further includes information to locate the next skip entry in the inverted 
list. 

29. (Currently Amended) The inverted index of claim 25, wherein the 
boundary posting includes a document identifier having the higher value in the range of 
values and the block of postings precede the first skip entry in the inverted list. 

30. (Currently Amended) The inverted index of claim 29, wherein the 
first skip entry further includes information to locate the next skip entry in the inverted 
list. 

31 . (Currently Amended) The inverted index of claim 25, wherein x is 
selected from in th o a first range of 256 to 512 and y is selected from i n th e a second 
range of 128 to 256 , wherein v cannot be selected to have the same value as x . 

32. (Original) The inverted index of claim 25, wherein substantially all of 
the index terms occur in x documents or fewer. 

33. (Previously Presented) The inverted index of claim 32, wherein at 
least approximately 80% of the index terms occur in x documents. 
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34. (Original) The inverted index of claim 25, wherein the collection of 
one or more documents includes one or more binary files, data tables, source code 
files, text documents or combinations thereof. 

35. (Original) The inverted index of claim 25, wherein the one or more 
inverted lists further include a skip entry before the first posting in the inverted list. 

36. (Original) The inverted index of claim 25, wherein the one or more 
inverted lists further include a skip entry after the last posting in the inverted list. 

37. (Currently Amended) An article comprising a machine-readable 
medium storing instructions operable to cause one or more machines to perform 
operations comprising: 

determining a first value x representative of a first location of a first skip entry in 
an inverted index such that x is determined as an integer corresponding to a quantity of 
documents including at least a majority of the index terms occur i n x documents or 
f e w e r, wh e r e x i s an int e g e r ; 

determining a second value y, where y does not exceed i s not e qua l to x, the 
second value v determined as an integer and representative of a second location of a 
second skip entry in the inverted index wh o r o v is an integer : 

generating [[an]] the inverted index for the collection of documents, the inverted 
index including an inverted list for each of the index terms, each inverted list including at 
least one posting and, if the number of postings exceeds x, further including [[a]] the first 
skip entry after the x th posting and one or more second skip entries thereafter at 
intervals of every y th posting; 

wherein: 
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[[a]] the at least one posting includes a document identifier identifying a 
document in the collection of documents; 

[[a]] the first and second skip errtfy entries including i nclud e s a document 
identifier that is included in a boundary posting of a block of postings immediately 
adjacent to the skip entry in the inverted list, where a block of postings includes postings 
having document identifiers ranging from a lower to an upper value and where a 
boundary posting is a posting having a document identifier of either the lower or the 
upper value. 

38. (Previously Presented) The article of claim 37, wherein each 
posting further includes a position identifier identifying a position of the index term in the 
document, and a frequency of the index term occurring in the document. 

39. (Currently Amended) The article of claim 37, wherein the boundary 
posting includes a document identifier having the lower value in the range of values and 
the block of postings follow the first skip entry in the inverted list. 

40. (Currently Amended) The article of claim 39, wherein the first skip 
entry further includes information to locate the next skip entry in the inverted list. 

41 . (Currently Amended) The article of claim 37, wherein the boundary 
posting includes a document identifier having the higher value in the range of values 
and the block of postings precede the first skip entry in the inverted list. 

42. (Currently Amended) The article of claim 41 , wherein the first skip 
entry further includes information to locate the next skip entry in the inverted list. 

43. (Canceled). 
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44. (Currently Amended) The article of claim 37, wherein x is selected 
from in-tbe a first range of 256 to 512 and v selected from m-the a second range of 128 
to 256 , wherein v is not selected to have the same value as x . 

45. (Original) The article of claim 37, wherein the collection of one or 
more documents includes one or more binary files, data tables, source code files, text 
documents or combinations thereof. 

46. (Original) The article of claim 37, further comprising instructions 
operable to cause one or more machines to perform operations comprising: 

compressing the inverted index. 

47. (Original) The article of claim 37, wherein substantially all of the 
index terms occur in x documents or fewer. 

48. (Previously Presented) The article of claim 47, wherein at least 
approximately 80% of the index terms occur in x documents. 

49. (Original) The article of claim 37, wherein for each inverted list, if 
the number of postings exceeds x, further including a skip entry before the first posting 
in the inverted list. 

50. (Original) The article of claim 37, wherein for each inverted list, if 
the number of postings exceeds x, further including a skip entry after the last posting in 
the inverted list. 

51 . (Currently Amended) An article comprising a machine-readable 
medium storing instructions operable to cause one or more machines to perform 
operations comprising: 
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receiving a collection of documents, each document comprising one or more 
index terms; 

determining a first value x representative of a first location of a first skip entry in 
an inverted index , wherein at least a majority of the index terms occur [[in]] within x 
documents or f e w e r and x is an integer in a first range of 256 to 512; 

determining a second value y representative of a second location of a second 
skip entry in the inverted index , wherein y does not exceed the value x and is an integer 
in a second range of 128 to 256; and 

generating [[an]] the inverted index for the collection of documents, the inverted 
index including an inverted list for each of the index terms, each inverted list including at 
least one posting and, if the number of postings exceeds x, further including [[a]] the first 
skip entry after the x th posting and one or more second skip entries thereafter at 
intervals of every y th posting; 

wherein: 

[[a]] the at least one posting includes a document identifier identifying a 
document in the collection of documents; 

[fall the first and second skip e ntry i nclud e s entries include a document identifier 
that is included in a boundary posting of a block of postings immediately adjacent to the 
skip entry in the inverted list, where a block of postings includes postings having 
document identifiers ranging from a lower to an upper value and where a boundary 
posting is a posting having a document identifier of either the lower or the upper value. 
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52. (Previously Presented) The article of claim 51 , wherein each 
posting further includes a position identifier identifying a position of the index term in the 
document, and a frequency of the index term occurring in the document. 

53. (Currently Amended) The article of claim 51 , wherein the boundary 
posting includes a document identifier having the lower value in the range of values and 
the block of postings follow the first skip entry in the inverted list. 

54. (Currently Amended) The article of claim 53, wherein the first skip 
entry further includes information to locate the next skip entry in the inverted list. 

55. (Currently Amended) The article of claim 51 , wherein the boundary 
posting includes a document identifier having the higher value in the range of values 
and the block of postings precede the first skip entry in the inverted list. 

56. (Currently Amended) The article of claim 51 , wherein the first skip 
entry further includes information to locate the next skip entry in the inverted list. 

57. (Original) The article of claim 51 , wherein substantially all of the 
index terms occur in x documents or fewer. 

58. (Original) The article of claim 57, wherein approximately 80 to 90% 
of the index terms occur in x documents or fewer. 

59. (Original) The article of claim 51 , wherein for each inverted list, if 
the number of postings exceeds x, further including a skip entry before the first posting 
in the inverted list. 

60. (Original) The article of claim 51 , wherein for each inverted list, if 
the number of postings exceeds x, further including a skip entry after the last posting in 
the inverted list. 
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